BAC-Pool Sequencing and Assembly of 19 Mb of the Complex Sugarcane Genome

نویسندگان

  • Vagner Katsumi Okura
  • Rafael S. C. de Souza
  • Susely F. de Siqueira Tada
  • Paulo Arruda
چکیده

Sequencing plant genomes are often challenging because of their complex architecture and high content of repetitive sequences. Sugarcane has one of the most complex genomes. It is highly polyploid, preserves intact homeologous chromosomes from its parental species and contains >55% repetitive sequences. Although bacterial artificial chromosome (BAC) libraries have emerged as an alternative for accessing the sugarcane genome, sequencing individual clones is laborious and expensive. Here, we present a strategy for sequencing and assembly reads produced from the DNA of pooled BAC clones. A set of 178 BAC clones, randomly sampled from the SP80-3280 sugarcane BAC library, was pooled and sequenced using the Illumina HiSeq2000 and PacBio platforms. A hybrid assembly strategy was used to generate 2,451 scaffolds comprising 19.2 MB of assembled genome sequence. Scaffolds of ≥20 Kb corresponded to 80% of the assembled sequences, and the full sequences of forty BACs were recovered in one or two contigs. Alignment of the BAC scaffolds with the chromosome sequences of sorghum showed a high degree of collinearity and gene order. The alignment of the BAC scaffolds to the 10 sorghum chromosomes suggests that the genome of the SP80-3280 sugarcane variety is ∼19% contracted in relation to the sorghum genome. In conclusion, our data show that sequencing pools composed of high numbers of BAC clones may help to construct a reference scaffold map of the sugarcane genome.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sequence-based physical mapping of complex genomes by whole genome profiling.

We present whole genome profiling (WGP), a novel next-generation sequencing-based physical mapping technology for construction of bacterial artificial chromosome (BAC) contigs of complex genomes, using Arabidopsis thaliana as an example. WGP leverages short read sequences derived from restriction fragments of two-dimensionally pooled BAC clones to generate sequence tags. These sequence tags are...

متن کامل

BAC-Pool Sequencing and Analysis of Large Segments of A12 and D12 Homoeologous Chromosomes in Upland Cotton

Although new and emerging next-generation sequencing (NGS) technologies have reduced sequencing costs significantly, much work remains to implement them for de novo sequencing of complex and highly repetitive genomes such as the tetraploid genome of Upland cotton (Gossypium hirsutum L.). Herein we report the results from implementing a novel, hybrid Sanger/454-based BAC-pool sequencing strategy...

متن کامل

BAC-pool sequencing and analysis confirms growth-associated QTLs in the Asian seabass genome

The Asian seabass is an important marine food fish that has been cultured for several decades in Asia Pacific. However, the lack of a high quality reference genome has hampered efforts to improve its selective breeding. A 3D BAC pool set generated in this study was screened using 22 SSR markers located on linkage group 2 which contains a growth-related QTL region. Seventy-two clones correspondi...

متن کامل

Bacterial artificial chromosome-based physical map of the rice genome constructed by restriction fingerprint analysis.

Genome-wide physical mapping with bacteria-based large-insert clones (e.g., BACs, PACs, and PBCs) promises to revolutionize genomics of large, complex genomes. To accelerate rice and other grass species genome research, we developed a genome-wide BAC-based map of the rice genome. The map consists of 298 BAC contigs and covers 419 Mb of the 430-Mb rice genome. Subsequent analysis indicated that ...

متن کامل

Long Read Sequencing Technology to Solve Complex Genomic Regions Assembly in Plants

During the last decade, we have observed remarkable advances in sequencing technology and bioinformatics analysis. The turning point came when the pyrosequencing technologies became available for the scientific community. Following Sanger’s method, pyrosequencing has provided a massive increase in sequencing throughput combined with a huge decrease in the cost per sequenced base. Thus, it becam...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Frontiers in plant science

دوره 7  شماره 

صفحات  -

تاریخ انتشار 2016